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ABSTRACT 


We  present  an  algorithm,  inspired  by  self-organization  and  stigmergy  observed 
in  biological  swarms,  for  managing  multiple  sensors  tracking  large  numbers  of  tar¬ 
gets.  We  devise  a  decentralized  architecture  wherein  autonomous  sensors  manage 
their  own  data  collection  resources  and  task  themselves.  Sensors  cannot  commu¬ 
nicate  with  each  other  directly;  however,  a  global  track  file,  which  is  continuously 
broadcast,  allows  the  sensors  to  infer  their  contributions  to  the  global  estimation  of 
target  states.  Sensors  can  transmit  their  data  (either  as  raw  measurements  or  some 
compressed  format)  only  to  a  central  processor  where  their  data  are  combined  to 
update  the  global  track  hie.  We  outline  information-theoretic  rules  for  the  general 
rnultiple-sensor  Bayesian  target  tracking  problem.  We  provide  specific  formulas  for 
problems  dominated  by  additive  white  Gaussian  noise.  Using  Cramer  Hao  lower 
bounds  as  surrogates  for  error  covariances,  we  illustrate,  using  numerical  scenar¬ 
ios  involving  ballistic  targets,  that  the  bioinspired  algorithm  is  highh  scalable  and 
performs  very  well  for  large  numbers  of  targets. 
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1.  INTRODUCTION 


A  major  objective  of  multiple-sensor  target  tracking  systems  is  to  achieve  cert  ain  levels  of  state 
estimation  accuracy  for  as  many  targets  as  possible.  In  applications  where  a  single  sensor  is  tasked 
with  collecting  measurements  on  a  single  target,  the  management  of  the  sensor's  available  resources 
does  not  pose  any  challenge.  However,  in  situations  where  a  small  number  of  sensors  are  directed 
to  collect  measurements  on  a  large  number  of  targets,  the  decision  proc  ess  that  determines  which 
sensors  are  to  collect  measurements  on  which  targets  during  the  next  observation  time  interval  can 
be  daunting.  The  sensor  management  problem  becomes  more  difficult  as  the  number  of  targets 
relative  to  the  number  of  sensors  increases.  The  problem  further  exacerbated  when  targets  and 
sensors  are  not  arranged  in  a  favorable  geometric  constellation.  Ideally,  one  wishes  to  “service” 
as  many  targets  as  possible;  that  is.  one  would  like  to  ensure  that  certain  levels  of  accuracy,  or 
“qualities  of  service.”  can  be  achieved  in  estimating  the  states  of  as  many  t  argets  as  possible.  In  this 
report,  we  present  a  target  selection  algorithm  that  is  inspired  by  principles  governing  biological 
swarm  intelligence. 

In  multiple-sensor  target  tracking  systems,  data  collected  by  tasked  sensors  are  often  corre¬ 
lated  and  fused  to  form  a  global  database  of  target  state  estimates  and  error  covariances,  referred 
to  as  the  “global  track  file"  (e.g..  see  1]  and  [2]).  Sensor  data  can  be  processed  either  at  a  cen¬ 
tral  processing  node  or  in  a  distributed  fashion,  giving  rise  to  a  plethora  of*  data  correlation  and 
fusion  architectures  (e.g.,  see  [3]).  In  this  report,  we  pursue  a  rnainlv  decentralized  approach  to 
sensor  management  that  would  be  readily  adaptable  to  a  bioinspired  implementation.  However, 
our  architecture  is  not  devoid  of  central  processing.  In  a  way,  it  can  be  regarded  as  a  hybrid.  In 
onr  implementation,  sensors  are  not  allowed  to  directly  communicate  with  each  other,  yet  a  global 
track  file,  maintained  by  a  central  processor,  is  continuously  broadcast,  and  its  contents  are  there¬ 
fore  assumed  to  be  always  available  to  all  sensors.  The  sensors  operate  locally  and  autonomously 
by  tasking  their  own  resources  based  on  the  information  available  in  the  global  track  file.  In  our 
architecture,  sensors  make  adjustments  to  their  tasking  decisions  based  on  an  assessment  of  their 
anticipated  relative  contributions  to  the  global  track  file  over  a  forthcoming  data  collection  time 
interval;  thus,  the  global  track  file  provides  an  indirect  means  for  sensors  to  communicate  with 
each  other.  Sensors  transmit  their  data  only  to  the  central  processor,  where  they  are  correlated 
and  fused  to  update  the  global  track  file. 

A  decentralized  approach  to  sensor  management  has  many  benefits.  Foremost.,  it  is  robust  in 
that  the  overall  system  performance  is  not  significantly  degraded  by  the  failure  of  a  few  member 
sensors.  More  importantly,  a  decentralized  implementation  is  scalable:  that  is.  the  complexity  of 
the  implementation  does  not  increase  significantly  as  more  sensors  are  added  to  the  system.  Since 
all  tasking  decisions  are  made  locally  by  the  sensors  themselves,  the  computing  and  communication 
bandwidth  requirements  are  less  stringent  than  they  would  be  in  a  purely  centralized  approach. 
Many  examples  of  decentralized  approaches  to  sensor  management  can  be  found  in  the  literature. 
For  example,  in  [4],  the  authors  apply  a  novel  game-theoret  ic  approach  to  dynamic  target  selection. 
Of  relevance  to  our  work  is  [5],  wherein  the  authors  employ  an  information-theoretic  approach  to 
decentralized  sensor  management. 
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Bioinspirecl  approaches  to  solving  combinatorial  optimization  problems  follow  either  or  both 
of  two  main  principles  observed  in  nature:  “self-organization*  and  “stigmergy.”  For  a  full  and  lucid 
account  on  swarm  intelligence  and  its  applicability  to  solving  combinatorial  optimization  problems, 
we  refer  the  reader  to  [6].  There  we  learn  that  in  biological  swarms,  such  as  schools  of  fish  or 
flocks  of  birds,  individual  members,  without  any  awareness  of  actions  taken  by  other  members, 
are  capable  of  unconsciously  arriving  at  a  meaningful,  and  often  very  complex,  global  behavior  by 
merely  following  a  set  of  very  simple  rules.  This  phenomenon  is  referred  to  as  “self-organization” 
or  “emergent  behavior.”  Self-organization  is  not  limited  to  biological  systems  and  is  also  observed 
in  other  complex  decentralized  systems  such  as  free-market  economies.  An  application  of  self- 
organization  to  sensor  management  was  developed  in  [7]  (see  also  [8]).  We  extend  and  formalize 
the  rules  considered  in  [7]  and  8]  using  the  information-theoretic  approaches  employed  in  [5],  [9], 
and  [10]. 

In  biological  swarms,  simple  local  rules  alone  may  not  be  sufficient  to  provide  the  means  for 
the  system  as  a  whole  to  reach  a  particular  global  goal.  Although  individual  members  in  swarms  are 
not  consciously  aware  of  actions  taken  by  other  members,  they  can  nevertheless  communicate  with 
each  other  indirectly  through  the  impact  they  lay  on  their  environment.  As  individuals  impact  their 
environment  through  their  actions,  the  environment,  in  turn,  influences  individual  behavior,  thereby 
fostering  a  learning  mechanism.  This  phenomenon  is  referred  to  as  “stigmergy”  or  “seinatectonic 
communication.”  A  prime  example  in  nature  is  that  of  ant  trails.  Frequently  trodden  paths  to 
existing  food  sources  are  doused  with  pheromone  sprayed  by  passing  ants.  Pheromone  traces,  in 
turn,  encourage  further  traffic.  Once  a  particular  food  source  has  been  exhausted,  ants,  following 
simple  rules,  begin  searching  for  new  sources  of  nourishment.  At  this  point,  the  trail  to  the  empty 
food  source  becomes  less  frequented,  and  extant  pheromone  traces  begin  to  evaporate,  thereby 
discouraging  further  visitation.  Ants  who  find  a  new  food  source  bv  leaving  behind  new  traces  of 
pheromone  then  encourage  other  ants  to  visit  the  new  path,  and  the  process  begins  anew.  The 
application  of  both  self-organization  and  stigniergy  to  sensor  management  was  first  explored  in  [7]. 
By  employing  the  information-theoretic  formulation  of  [10],  we  extend  and  formalize  ideas  originally 
developed  in  [11],  [12],  and  7]  to  allow  sensors  to  communicate  with  each  other  indirectly  via  the 
global  track  file  that  represents  the  “environment”  in  our  application. 

Throughout  this  report,  we  assume  data  association  to  be  perfect.  In  many  real-world  prob¬ 
lems.  poor  data  association  can  be  a  major  limiting  factor  to  the  reliable  performance  of  target 
tracking  systems,  and  its  impact  on  sensor  management  must  therefore  be  brought  to  bear  (e.g., 
see  [1]  and  [2]).  Nevertheless,  even  in  cases  of  perfect  data  association,  sensor  management  can  be 
a  daunting  task  when  the  number  of  sensors  is  significantly  smaller  than  the  number  of  targets. 
Although  absolute  performance  predictions  would  be  optimistic,  we  believe  that  the  formulation 
we  present  in  this  report  can  at  least  shed  light  on  the  relative  performance  of  candidate  target 
selection  algorithms.  We  have  also  ignored  the  impact  of  sensor  biases  that  can  have  an  adverse 
effect  on  data  association.  We  defer  to  a  future  investigation  the  impact  of  poor  data  association 
and  sensor  biases  on  the  performance  of  the  target  selection  algorithm  presented  in  this  report. 

We  begin  in  Section  2  by  reviewing  the  sequential  Bayesian  formulation  of  the  single-target 
tracking  problem.  In  Section  2,  we  also  present  the  Renyi  a-entropy  and  ^-divergence  [13]  that  are 
extensively  utilized  in  our  information-theoretic  approach  (cf.  [9]  and  [10]).  The  bioinspired  target 
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selection  algorithm  for  the  general  Bayesian  formulation  is  presented  in  Section  3.  In  Section  1. 
we  present  the  specific  formulation  of  the  multiple-sensor  target  tracking  problem  for  systems 
dominated  by  additive  white  Gaussian  noise.  In  Section  5,  we  focus  on  targets  whose  motion  can 
be  described  by  a  deterministic  system  dynamic  model.  Deterministic  dynamics  allow  a  simple 
recursive  formulation  of  the  Cramer  Rao  lower  bound  on  the  error  covariance  associated  with  the 
estimated  target  state  [14].  In  Section  6.  we  use  the  Cramer  Rao  lower  bounds  derived  in  Section  5 
as  surrogates  for  target  state  estimation  error  covariances  to  evaluate  the  performance  of  tin*  target 
selection  algorithm  outlined  in  Section  3  for  an  example  scenario  involving  ballistic  targets.  Lastly, 
in  Section  7,  we  summarize  our  results  and  discuss  how  the  bioinspired  target  selection  algorithm 
may  be  further  improved  for  specific  applications. 
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2.  RELATIVE  INFORMATION  GAIN 


In  this  report,  we  adopt  an  information-theoretic  approach  to  target  select  ion  wherein  we  favor 
targets  whose  measurements,  once  collected  during  an  upcoming  observat  ion  period,  would  result  in 
the  largest  reduction  in  the  uncertainties  associated  with  their  state  estimates  (cf.  [5].  [9],  and  10]). 
We  begin  this  section  by  reviewing  the  recursive  Bayesian  formulation  of  the  single-target  tracking 
problem.  We  then  provide  general  expressions  for  the  inforination-theoretif  measures  utilized  by 
the  bioinspired  target  selection  logic  outlined  in  Section  3. 


2.1  SEQUENTIAL  BAYESIAN  ESTIMATION 


Within  the  Bayesian  framework,  we  seek  to  estimate  the  probability  density  function  of  a  tar¬ 
get's  state  vector,  xa-  €  R".  valid  at  time  /*.,  conditioned  on  a  sequence  of  noisy  measurements  [15]. 
Useful  statistical  measures,  such  as  the  mean  and  t he  covariance,  are  then  obtained  directly  from 
the  conditional  density.  We  limit  the  discussion  to  the  class  of  Markov  processes.  The  general 
nonlinear  system  dynamic  model  prescribing  the  temporal  evolution  of  the  target,  state  vector  from 
time  tft—i  to  time  tk  is  assumed  to  be  given  by 

Xfc  =  ¥>a  1  (Xfc  I,w*  i),  (!) 

where  w a-  \  is  a  random  vector  sampled  from  the  known  probability  density  function  p(\\'r  i)-  1  he 
general  nonlinear  observation  model  mapping  the  target  state  space  to  the  measurement  space  is. 
in  turn,  assumed  to  be  given  bv 

JS*  =  h*(x*.  V*),  (2) 

where  v^.  is  a  random  vector  sampled  from  the  known  probability  density  function  /;(va).  The 
noise  vectors  j  and  va*  in  (1)  and  (2).  respectively,  are  assumed  to  be  white  and  mutually 
independent . 


Given  a  sequence  of  measurements 


*1  k  =  {*1.22 . }  • 
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we  wish  to  estimate  the  posterior  conditional  density,  /> (xa- |zi  a- )•  of  the  target  state  vector,  xa, 
valid  at  time  t^.  Models  (1)  and  (2).  with  their  underlying  assumptions  of  Markovianity  and  white 
and  independent  noise  vectors,  render  a  recursive  formulat  ion.  Given  the  density,  p(x a-  \  \  z\-.k  \  ). 
of  the  target  state  vector  valid  at  time  t a-  i  conditioned  on  the  sequence*  of  measurements  collected 
up  to  time  i,  the  prior  conditional  density,  p (x^  Zi;a.  i  ).  of  the  target  state  vector  valid  at  time 
t a-  can  be  obtained  from  the  Chapman  Kolmogorov  equation: 

p(xk\zi.k-\)  =  I  P  (xa  |  X/,  1  )p(xk  1  [  Z1:a-  i)  tlxk  (1) 


where  p  (xa-  |  xa-  i  )  is  the  state  transit  ion  density  obtained  from  (1 ).  Ihe  posh  rior  density,  p  (xa  /-i.a-  )- 
conditioned  on  the  sequence  of  measurements  collected  up  to  time  U-  is.  in  turn,  obtained  from  the 
Bayes  rule: 


p(xk.  |  ZJ-A-) 


P  (ZA-  |  XA-  )  p  (xA-  I  Z] -A-  i) 
p{zk.  )  Z ,  a-  1  ) 


(5) 


(6) 


where  the  likelihood  function,  p( |x/.).  is  obtained  from  (2),  and 

P  (z*  |  Zl:fc-1  )  =  J  P  (z*  |  XA,.  )  J)  (xk  \  Zhk-  j  )  rfxfc. 

It  follows  that  start  ing  with  an  initial  probability  density  function,  p(xo).  of  the  target  state  vector. 
xq,  valid  at  a  time  to  prior  to  any  observation,  relations  (4)  and  (5)  can  be  applied  recursively  to 
update  the  conditional  probability  density  function  of  the  target  state  vector  its  new  measurements 
become  available. 


2.2  ENTROPY  AND  DIVERGENCE 


The  contribution  of  each  new  measurement  to  the  conditional  probability  density  function  of 
a  target’s  state  by  means  of  the  Bayes  rule  (5)  amounts  to  a  reduction  in  the  uncertainty  associated 
with  the  target’s  state  estimate.  A  reduction  in  uncertainty  can  be  equivalently  regarded  as  a  gain 
in  information.  The  Renyi  a-entropy  provides  a  general  scalar  measure  of  uncertainty  [10]: 

Ha  (zj  k)  =  T— - log  j  Vn  (x,,  I  Zi,k)  <lxk,  (7) 

1—0  J 

where  0  <  a  <  1.  In  information  theory,  “information”  is  defined  as  the  negative  of  entropy  [16]; 
that  is,  — Applying  L’Hopital’s  rule,  it  follows  that  as  a  approaches  unity,  the  Renyi 
o-entropy  (7)  reduces  to  the  Shannon  entropy: 


H  \  (z\:k)  =  lim  H« 

a->l 


/>(x*  I  Zj:^ )  log  p  {xk  |  z1:a-  )  dxk. 


(8) 


A  general  scalar  measure  of  the  relativre  information  between  the  prior  and  posterior  probability 
density  functions,  p  (xk  |  z\  k  j )  and  p  (xk  \  z^-k  ),  respectively,  is.  in  turn,  obtained  from  the  Renvi 
o-divergence  [10]: 


At  (zi.A-  ||  zi.A-  1  )  = 


a  -  1 


log 


P(xa-  |ziQ 

,/>(x*  |zi  A-  1  ) 


/a(xa-  |zi  a-  l)  dx A  . 


(9) 


Again,  applying  L’Hopital’s  rule,  it  follows  that  as  a  approaches  unity,  the  Renyi  o-divergence 
reduces  to  the  Knllback  Leibler  divergence : 


A  (Z):A-  ||zi:A-  i)  =  lim Va  (z1;A-  ||  Zl;fc  l)=  /  p(xfc|z1:*)log 
a  -*  1  J 

When  <v  =  1/2.  we  obtain  the  Hettinger  affinity: 


7>(xfr|Zi;*_i) 


dx,.. 


A/2  (zi-A •  II  Z1:A-  1 )  =  -2  log  J  \/p  (xa-  |  Zi;A- )  p  (xa-  |  zi  A,— i )  dxk, 


(10) 


(11) 


where  the  integral  on  the  right-hand  side  of  (11)  is  recognized  as  the  Bhattacharyya  coefficient. 

The  Renyi  o-divergence  is  a  measure  of  dissimilarity  between  two  distributions.  Although 
it  violates  the  symmetry  and  triangle  inequality  conditions,  it  is  nevertheless  helpful  to  think  of 
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the  Renyi  o-divergence  as  a  distance  function.  For  certain  special  cases,  the  Renyi  o-divergence 
can  in  fact  be  appropriately  transformed  to  yield  a  proper  distance  function.  For  example,  in  the 
case  of  a  =  1/2.  the  Hellinger  distance ,  which  satisfies  all  the1  conditions  required  to  be  a  proper 
mathematical  metric,  is  given  by 


(Zl :k  ||  Zl  k 


\  >  -  exp 

'Pi/2(*1  *  * 

a.  a  t 

2 

V 

(12) 


where  the  exponential  expression  under  the  square  root  sign  on  the  right -hand  side  of  (12)  is 
recognized  as  the  Bhattacharyya  distance. 

To  determine  on  which  target  a  sensor  should  collect  measurements  during  the  next  obser¬ 
vation  period,  we  adopt  the  general  strategy  pursued  by  [5]  and  [10]  by  favoring  the  target  whose 
expected  measurements  would  result  in  the  largest  gain  in  relative  informat  ion;  that  is,  we  select 
t  ho  target  whose  Renyi  a-divergence  would  be  the  largest.  The  choice  of  o  can  have  an  impact  on 
performance.  Values  of  a  =  1  and  a  =  1/2.  corresponding  to  the  Kullback  Leibler  divergence  and 
Hellinger  affinity,  respectively,  have  been  reported  to  provide  favorable.'  results  [10].  The  value  of 
o  =  1/2  appears  to  serve  as  a  better  discriminant  between  two  similar  densities  such  as  the  prior 
and  posterior  conditional  probability  density  functions  encountered  in  target  tracking  problems. 
The  choice  of  a  need  not  remain  constant  and  can  be  allowed  to  vary  adaptively.  The  prior  and 
posterior  conditional  probability  density  functions  of  newly  formed  tracks  mi  arguably  less  similar 
in  shape  than  the  prior  and  posterior  conditional  densities  of  mature  tracks,  thereby  suggesting 
values  of  a  closer  to  unity  for  newly  formed  tracks  and  values  of  a  closer  to  1/2  for  mature  t  racks. 
In  the  numerical  example  presented  in  Section  6.  we  show  results  for  o  1/2.  The  optimal  choice 
of  a  for  the  sensor  management  algorithm  outlined  in  this  report  will  bo  pursued  as  part  of  a  future 
study. 
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3.  DYNAMIC  TARGET  SELECTION 


The  decentralized  dynamic  target  selection  logic  we  present  is  predicated  upon  favoring  the 
target  whose  measurements,  once  collected  during  an  upcoming  observation  period,  would  provide 
the  largest  weighted  information  gain.  We  choose  the  Henyi  o-divergeuce  as  a  measure  of  relative 
information  gain.  The  weighting,  in  turn,  enables  the  local  sensor  to  learn  from  the  environment 
(here,  the  global  track  file)  the  contribution  of  the  other  sensors  with  which  it  has  no  direct  contact . 
Furthermore,  prior  to  maximization,  we  apply  a  set  of  preselection  rules  that  reduce  the  processing 
load.  Since  no  measurements  on  any  targets  are  available  prior  to  any  data  collection,  the  potential 
information  gain  is  obtained  by  computing  the  expected  value  of  the  Rem  i  a-divergence  with  respect 
to  the  probability  density  function  of  the  measurements  to  be  collected  during  the  forthcoming 
observation  period. 

In  order  to  expedite  the  formulation  of  the  decentralized  dynamic  target  selection  logic,  we 
introduce  some  new  notation  first.  We  denote  the  set  of  measurements  originating  From  the  /'tli 

target  (where  v  =  collected  by  the  /ith  sensor  (where  //  I . Ay)  within  the  time 

interval  [/*._!,£*)  with 


we  denote  the  set  of  all  measurements  originating  from  the  /zth  target  eolle<  ted  by  the  /itli  sensor 
up  to  (but  not  including)  time  t a  with 


(14) 


and  we  denote  the  set  of  all  measurements  originating  from  the  /oh  target  collected  by  all  sensors 
up  to  (but  not  including)  time  with 


(15) 


Our  bioinspired  target  selection  logic  applies  the  principles  of  self-organization  and  stigmergy  t  hat 
are  often  encountered  in  nature.  In  the  sequel,  we  give  detailed  accounts  of  how  wo  implement  each 
principle  within  the  context  of  sensor  management  . 

3.1  SELF-ORGANIZATION 

In  our  application,  sensors  following  simple  local  target  selection  rules  induce  the  multiple- 
sensor  target  tracking  system  to  arrive  at  a  global  database  of  estimated  target  state  vectors  that 
meet  their  desired  qualities  of  service.  Her  e,  we  consider  three  simple  rules.  We  begin  by  selecting 
targets  that  have  not  yet  met  their  qualities  of  service  (QoS)  at  time  /a  \: 


(Hi) 


If  §''  is  empty,  then  no  further  action  is  required  as  all  targets  have  already  met  their  desired 
qualities  of  service.  From  the  list  of  remaining  targets  (if  any),  we  then  select  those  that  would 


meet  their  qualities  of  service  if  new  measurements  were  collected  on  them  and  were  subsequently 
incorporated  by  the  //th  sensor  to  update  their  local  state  estimates: 


where 


§2  =  {"  |  *  €  s'\  iia  (zf,  z'',  ,)  <  QoS^  }  . 

K*  (z'r,  2l:fc-l)  =  J  P  (Zf  I  2l:fc  l  ) 


(17) 


(18) 


is  the  expected  entropy  of  the  posterior  probability  density  function,  and  p  (z^  j  denotes 

the  probability  density  function  of  the  measurements  originating  from  the  u th  target  expected  to 
be  collected  by  the  //th  sensor.  We  revert  to  Sj  if  §£  is  empty.  Lastly  from  the  list  of  desired 
targets  (if  any),  we  select  the  target  whose  measurements  would  yield  the  largest  gain  in  relative 
information: 

/;„  =  arg  max  ("A £")  ,  (19) 

where 

"Af  =  J  P  (*r  I  1  )  7^0  Zi;t_!  ||  *?:*_!)  dz£"  (20) 

is  the  expected  local  information  gain. 


3.2  STIGMERGY 

The  simple  local  rules  considered  thus  far  do  not  employ  any  learning.  Although  sensors  can¬ 
not  communicate  with  each  other  directly,  they  can  nevertheless  infer  the  impact  of  their  measure¬ 
ments  on  the  global  track  file  relative  to  those  of  other  sensors  by  comparing  their  local  information 
gain  achieved  for  a  particular  target  during  the  last  observation  period  with  the  global  information 
gain  that  was  achieved  for  the  same  target  by  the  multiple-sensor  system  as  a  whole.  If  the  local 
information  gain  achieved  by  the  //th  sensor  for  the  v th  target  is  the  same  or  nearly  the  same  as 
the  global  information  gain  on  the  z/th  target,  then  the  //th  sensor  has  been  the  major  contributor 
to  updating  the  global  state  estimate  of  the  z/th  target.  Under  this  circumstance,  the  //th  sensor 
should  continue  collecting  data  on  the  vi h  target  as  it  is  very  likely  that  the  other  sensors  are  unable 
to  either  observe  the  //th  target  or  have  been  obliged  by  their  local  rules  to  attend  to  other  targets. 
On  the  other  hand,  if  the  local  information  gain  achieved  by  the  //th  sensor  for  the  z/th  target  is 
significantly  smaller  than  the  global  information  gain  on  the  v\ h  target,  then  the  //th  sensor  could 
not  have  been  a  major  contributor  and  should  therefore  be  induced  to  collect  measurements  on 
some  other  target  in  urgent  need  of  attention. 

Inspired  by  biology,  the  global  track  file  serves  as  the  “environment”  in  our  application  and 
provides  an  indirect  method  of  communication  between  the  sensors.  To  include  the  environment's 
impact  on  the  target  selection  logic,  we  modify  the  local  rule  (19)  to  read  as 

/z„  =  arg  max  (wj^j  aA£")  ,  (21) 

where  the  weight,  with  0  <  tz/j^j  <  1,  serves  as  a  learning  index.  Assuming  that  all  targets 

yield  the  same  expected  information  gain  ftA^.  relation  (21)  implies  that  targets  with  weights 


in 


Figure  1.  Heuristic  June  turn  modeling  the  *; forgetting "  effect  in  the  learning  index,  j ,  when  the  last  local 
track  update  occurs  at  a  time  t[  \  earlier  than  the  last  global  track  update  firm,  j. 


closer  to  unity  will  be  favored  over  targets  with  weights  closer  to  zero.  When  then4  exists  a  local 
update  at  time  tg-  j,  we  choose  the  learning  index  to  be  given  by 


w 


\IV 

*-i 


a 


n 


\ltl/ 
^k  1 


(22) 


where 


o  \fiV  _ 
^k  1  “ 


Da  ( 


A11' 

A.k 


J\:k  2 


h.k 


-2) 


(2;i) 


is  the  actual  gain  in  local  information  obtained  by  the  //th  sensor  on  the  /'th  target  at  time  /;•  |. 
and 

,|Km)  (21) 

is  the  global  information  gain  for  the  lA  li  target  at  time  /*._  which  includes  measurements  collected 
by  all  sensors  (including  the  //tli  sensor).  If  ^Aj^  {  is  close  to  °A£  r  then  /e^  j  is  close  to  unity. 
Under  this  circumstance,  the  fith  sensor  is  perceived  as  a  major  contributor  to  updating  the  global 
state  estimate  of  the  // th  target  and  thus  favors  to  continue  collecting  data  on  the  v th  target.  Oil 
the  other  hand,  if  ^A^  x  is  significantly  smaller  than  aA^j,  then  the  //th  sensor  could  not  have 
been  a  major  contributor  to  updating  the  global  state  estimate  of  the  1  th  target,  and  thus  should 
attend  to  other  targets. 


When  the  last  local  update  occurred  at  some  time  If  \  <  U-  \.  we  choose  the  following 
heuristic  relation: 


[1  CXJ  ** 

-1  -  U  1 

\1 

1  (  xp  1 

T 

)\ 

(2a) 


where  r  is  a  globally  specified  time  constant  determining  the  rate  at  which  the  learning  index 
approaches  unity  thereby  inducing  a  “forgetting”  effect  when  there  are  no  local  measurement 
updates  (sec  Figure  1), 
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Figure  2.  Data  flow  diagram  for  the  distributed  sensor  management  architecture  for  problems  with  general 
system  dynamic  and  observation  models. 


3.3  DATA  FLOW 


The  data  flow  diagram  for  the  distributed  sensor  management  architecture  presented  in  this 
report  is  shown  in  Figure  2.  In  this  architecture,  autonomous  sensors  are  responsible  for  allocating 
their  own  data  collection  resources  based  on  (1)  the  information  available  in  the  global  track  file 
(GTF)  and  (2)  the  expected  relative  information  to  be  gained  during  the  forthcoming  data  collection 
time  interval.  /,*  i  <  t  <  Ik*  The  local  processing  cycle  is  detailed  in  the  box  in  the  upper  right 
corner  of  Figure  2.  While  sensor  management  occurs  strictly  locally  at  the  sensor  level,  track 
maintenance  occurs  globally  at  the  central  processing  node.  In  other  words,  our  architecture  does 
not  require  maintenance  of  local  track  files  The  global  track  file  is  populated  and  maintained  only 
with  reported  likelihoods  valid  during  the  data  collection  time  interval,  tk  i  <  t  <  tk . 

As  discussed  earlier,  in  this  report  we  assume  perfect  data  association;  that  is,  we  assume 
there  is  no  ambiguity  as  to  which  measurement  is  originated  from  which  target.  In  dense  target 
environments,  data  association  imposes  a  serious  challenge  and  cannot  be  ignored.  Although  not 
pursued  any  further  in  this  report,  in  our  sensor  management  architecture,  data  association  must 


12 


he  performed  both  at  the  local  and  global  levels.  At  the  global  level,  the  measurement  likelihoods 
reported  from  multiple  sensors  must  he  correlated  with  propagated  global  t  racks  prior  to  fusion. 
At  the  sensor  level  (see  top-right  box  in  Figure  2).  once  measurements  have  been  collected,  they 
must  be  associated  with  the  propagated  copies  of  the  global  tracks.  For  the  scenario  shown  on 
the  left-hand  side  of  Figure  2.  each  sensor  tracks  only  one  target  at  a  time.  For  example,  sensor  1 
has  allocated  its  resources  to  collect  measurements  solely  on  target  2.  In  this  case,  local  data 
association  would  not  be  necessary.  However,  if  target  2  in  Figure  2  represented  a  dense  target 
complex  instead  (that  is.  a  family  of  closely-spaced  objects  as  opposed  to  a  single  object),  then 
the  multiple  measurements  originating  from  the  complex  must  be  associated  with  the  propagated 
copies  of  the  global  tracks  that  form  the  cluster. 
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4.  ADDITIVE  WHITE  GAUSSIAN  NOISE 


Although  somewhat  specialized,  the  sources  of  system  dynamic  and  measurement  uncertain¬ 
ties  in  many  target  tracking  problems  are  often  modeled  to  be  additive,  white,  and  Gaussian.  Such 
modeling  has  the  advantage  of  allowing  the  computation  of  the  prior  and  posterior  conditional 
probability  density  functions  needed  for  the  sequential  Bayesian  estimation  of  the  target  state  to 
be  reduced  to  the  computation  of  their  first  two  statistical  moments  mean  and  covariance. 


4.1  SEQUENTIAL  BAYESIAN  ESTIMATION 


Under  the  additive  white  Gaussian  noise  assumption,  the  nonlinear  system  dynamic  model  ( 1) 

Xfc  =  <Pk  i  (x£  l)  +  (26) 


for  the  v th  target  reduces  to 


where' 


Wfr ~  A' (wA. t&:  ,), 


(27) 


is  a  white  zero-mean  Gaussian  random  vector  with  covariance  The  function  A^x;//,  X)  — 

—  4(x  —  //)  ^  X  ](x  //)  denotes  the  probability  density  function  of  a  Gaussian  ran 


^rxpL 2' 

dom  variable,  x,  with  mean  //  and  covariance  X.  Similarly,  under  the  additive'  white  Gaussian  noise 
assumption,  the  nonlinear  observation  model  (2)  for  the  //th  sensor  and  the  //th  target  reduces  to 


<  =  K  (xS)  +  v 


\w 


(28) 


where 

v':'-~.V(v,,O.Rn  (29) 

is  a  white  zero-mean  Gaussian  random  vector  with  covariance  Ufk.  . 

Since  the  process  noise  vector,  w £  r  is  additive,  white,  and  Gaussian,  it  follows  from  (26) 
that  the  state  transition  probability  density  function  for  the  state  vector  of  the  iA\\  target  from 
time  th  i  to  time  tk  is  given  by 

P  K  I  i )  =  (x£:  ,  (x£  ,)  Qi'  i )  •  (30) 

Starting  w  ith  a  Gaussian  probability  density  function  for  the  //th  target  s  st  ate  vector.  xjj  { .  valid 
at  time  tk- 1*  conditioned  on  the  sequence  of  measurements,  z\  /,  \: 

p(xfc  i  I  zi  :fc  i)=Ar(x^  x£  |A,  ,,Pj;  vk  ,).  (31) 

multiplying  (30)  and  (31 ).  employing  a  first-order  Taylor  expansion  of  t  ho  system  dynamic  function, 
<^/Ll(‘)'  about  the  state  estimate.  x£  [k,  t.  and  integrating  the  result  with  respect  to  x£  t  over 
Rn,  it  follows  that  the  prior  conditional  probability  density  function  of  the  state  vector  of  the  i't h 
target  valid  at  time  //  can  be  approximated  with  a  Gaussian  funct  ion: 


)  -  ^  ( 


v'  P" 

Xk'Xkk 


) 


(32) 


where  x^.  {  and  1  denote  the  predicted  state  estimate  and  error  covariance  of  the  uih  target  , 
respectively.  It  can  be  shown  that  they  are  given  by  [17] 


and 


where 


1 

9- 

II 

i 

-Ul 

1 

(33) 

p*-n* 

-1  (^*-l)  +  Qa  1’ 

(34) 

1  (XA:-1|^  l) 

/or  \ 

_ 

dx-k-i 

(3o) 

is  the  state  transition  Jacobian  matrix. 


Similarly,  since  the  measurement  noise  vector.  v^L \  is  additive,  white,  and  Gaussian,  it  follows 
from  (28)  that  the  likelihood  function  associated  with  the  measurement  vector,  originating  from 
the  v tli  target  and  collected  by  the  /vth  sensor  is  given  by 

pel 

where 

=  k  (xfcik-i)  (37) 

denotes  the  predicted  measurement  vector.  Multiplying  (32)  and  (36).  employing  a  first  order 
Taylor  expansion  of  the  measurement  function.  h£(*),  about  the  predicted  state  estimate, 
and  integrating  the  result  with  respect  to  x£  over  IRn,  it  follows  that 


where 


p  |  z i-f. )  ~  Af  ;  z^,  Si  ^  , 

sr=^pjl*.i(«)T+Rr 


is  the  innovation  covariance  matrix,  and 


nr 


(38) 


(39) 


(40) 


is  the  measurement  Jacobian  matrix  evaluated  at  In  (39),  denotes  the  anticipated  error 

covariance  matrix  associated  with  the  measurement  originating  from  the  lA h  target  to  be  collected 
by  the  /ith  sensor  at  time  t^.  We  assume  that  it  is  possible  to  compute  the  measurement  error 
covariance  matrix  associated  with  a  given  target  prior  to  its  observation. 

Substituting  expressions  (32),  (36),  and  (39)  into  (5),  it  can  be  shown  that  the  posterior 
conditional  probability  density  function  can  also  be  approximated  with  a  Gaussian  function: 


p( 


x/c 


(41) 
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where  xj/^.  and  are  the  updated  state  estimate  and  error  covariance,  respectively,  of  the  vlU 
target  obtained  using  the  measurement  collected  by  the  //tli  sensor  at  time  /*..  It  can  be  shown 
that  they  are  given  by  [17] 


*k\k  ~ 

(pt«) 1 

i - 1 

13 

?r 

'**1.-1 +  ("*)' K)  <' 

( 12) 

and 

i  •  i 

(pi«) 1  - 

(p« .) 

+  (Hi‘)  (nr)  n£. 

(43) 

Equations  (42)  and  (43) 

can  be  rewritten  as 

x> 

Pr  ^ 

II 

4  i  + K 

r(<"-  «£**,* ,) 

(14) 

and 

1  k  k 

=  P*|*  1 

Kj-sr  (kdt  . 

(4r>) 

respectively,  where 

*r = p«*-i  (h';)t  (sr) 1  m 

is  the  Kalman  gain  matrix  [17].  Equations  (44)  and  (45)  are  recognized,  respectively,  as  the  state 
estimate  and  error  covariance  update  equations  of  the  extended  Kalman  filter.  I  he  state  estimate 
and  error  covariance  prediction  equations  of  the  extended  Kalman  Iilter.  in  turn,  are  given  In¬ 
equations  (33)  and  (34).  respectively. 

The  data  flow  diagram  for  the  distributed  sensor  management  architecture  with  system  dy¬ 
namic  and  observation  models  governed  by  additive  white  Gaussian  noise  is  shown  in  Figure  3. 
The  data  flow  is  similar  to  that  shown  in  Figure  2  for  the  general  system  dynamic  and  observation 
models  with  the  exception  that  now  only  measurement  vectors  and  their  associated  measurement 
error  covariances  need  be  reported.  Compared  to  likelihoods,  this  is  a  significant  reduction  in  the 
amount  of  information  that  the  sensors  need  to  report.  As  shown  in  Figure'  3.  in  eases  where 
there  may  be  limited  communication  bandwidth,  optionally,  data  can  be  further  compressed.  For 
example,  given  a  sequence  of  measurements  collected  over  the  time  interval  \  <  /  <  for 
certain  scenarios,  it  may  be  sufficient  to  report  the  parameter  vector  (along  with  its  associated 
estimation  error  covariance)  representing  the  (it  to  a  polynomial  of  an  appropriately  chosen  degree. 
The  measurement  vectors  and  their  associated  error  covariances  (or  their  compressed  versions)  arc' 
conflated  and  reported  as  a  data  bundle,  denoted  with  sj/r  for  the  fit h  sensor  and  the  /H h  target  in 
Figure  3: 
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Figure  3.  Data  flow  diagram  for  the  distributed  sensor  management  architecture  for  problems  with  system 
dynamic  and  observation  models  governed  by  additive  white  Gaussian  noise. 


4.2  ENTROPY  AND  DIVERGENCE 


We  next  present  expressions  for  the  entropy  and  divergence.  Substituting  (41)  into  (7)  and 
evaluating  the  integral  analytically,  it  can  be  shown  that  the  posterior  o-entropy  is  given  by  [5] 


Hn  (S 


r'Uk 


i)  =  ^log( 


(50) 


The  expected  posterior  entropy  needed  for  the  dynamic  target  selection  algorithm  outlined  in  Sec¬ 
tion  3  is,  in  turn,  obtained  from  substituting  (50)  and  (38)  into  (18)  and  evaluating  the  integral 
analytically.  It  can  be  readily  shown  that  it  is  given  by 
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which  is  the  same  as  (50).  Similarly,  substituting  (41)  into  (9)  and  evaluating  the  integral  analyti 
cally.  it  can  be  shown  that  the  local  information  gain  is  given  by  [10] 
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The  expected  local  information  gain  needed  for  the  target  selection  algorit  Inn  out  lined  in  Sect  ion  3  is. 
in  turn,  obtained  from  substituting  (52)  and  (38)  into  (20)  and  evaluating  the  integral  analytically. 
It  can  be  shown  that  it  is  given  by 
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Using  LTlopitaPs  rule,  it  can  be  shown  that  as  a  approaches  unity,  equations  (50)  and  (51)  reduce 
to 
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and 
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respectively;  equations  (52)  and  (53),  in  turn  reduce  to 
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respectively.  In  (50)  and  (57).  In  denotes  the  n  x  n  identity  matrix  where  n  dim (x)  is  the 
dimension  of  the  target  state  vector. 


When  a  =  1/2.  equations  (50)  and  (51)  reduce  to 
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respectively;  equations  (52)  and  (53).  in  turn,  reduce  to 
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respectively. 
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5.  DETERMINISTIC  MOTION 


In  the  absence  of  process  noise — that  is.  when  the  process  noise  vector  w*.  ].  in  the  system 
dynamic  model  (1)  can  be  set  equal  to  zero-  target  motion  becomes  deterministic.  For  example, 
in  the  special  case  of  tracking  ballistic  targets,  target  motion  is  prescribed  by  well-understood 
forces  that  can  be  modeled  to  a  high  degree  of  accuracy  (e.g..  see  18.  cli.  3]).  Thus,  for  ballistic 
targets,  motion  can  be  safely  assumed  to  be  deterministic.  When  tracking  Sf  targets  following 
deterministic  trajectories,  the  nonlinear  system  dynamic  model  prescribing  the  temporal  evolution 
of  the  vih  target  from  time  to  time  tk  becomes 

4  =  (62) 

where 

, «  i)  =  ,+  /‘  f rw)dt  m 

Jtk-i 

is  a  deterministic  function  prescribing  the  noise-free  transition  of  the  state  vector  ot  the  // tli  target 
from  time  tk-\  to  time  t*.  The  function  f/^-)  in  (63)  models  t  he  underlying  continuous-time  forces 
acting  on  the  z/th  target;  that  is. 

*t  (64) 

where  x£  denotes  the  first  derivative  of  x}'  with  respect  to  time1.  Mere  we  use  the  subscript  /  to 
distinguish  continuous-time  variables  from  discrete-time  variables,  which  in  turn  we  denote  with  the 
subscript  k  for  time  index ;  thus,  using  this  notation,  x*  =  x/A .  Applying  (62)  and  (63)  recursively, 
it  follows  that  the  state  vector,  x£,  of  the  // th  target,  valid  at  time  /*.  can  be  obtained  directly 
from  the  target  state  vector.  x{J,  valid  at  a  time  to  prior  to  any  data  collection: 

x£  =  x<U  r  VW)<IL  (65) 

dto 

Thus,  in  case  of  deterministic  motion,  target  dynamics  are  completely  characterized  by  the  initial 
state  vector,  x|^,  valid  at  time  Iq. 


In  the  absence  of  process  noise,  the  covariance  Qjr  |  in  equation  (3  1).  which  we  use  to 
propagate  the  error  covariance  of  the  /zth  target  's  error  covariance  bet  ween  measurements.  vanishes. 


Hence,  starting  with  the  updated  global  state  estimate.  x£ 


A  l  A*  l 


and 


error  covariance 


i|A  l* 


it  follows  that  for  a  set  of  Mj!  measurements  expected  to  be  collected  In  the  //tli  sensor  on  the  tA h 
target  within  the  forthcoming  observation  period,  tk  i  <  /  <  tk.  the  covariance  propagation  and 
update  equations  equations  (31)  and  (43).  respectively  can  be  combined  and  applied  recursively 
to  vield 


(pm)  '  -  (n-nt- .) 


-1 


-f 


A ft1 


£  K  >)]  (<")  Hf,_  (<  „  ,)*-(«,. .1,  ,). 


r=i 


(66) 


1  Assuming  that  the  function  f  has  continuous  first  partial  derivatives,  it  can  be  shown  that  the  transition  nui 
trix  (35)  can  be  obtained  from  solving  CI*  F4>.  where  F  (9f/r)x  (e.g..  see  19  ). 
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where  t?fi .  with  tk-i  <  tth  <  denotes  the  time  of  a  particular  measurement  expected  to  be 
collected  by  the  (Jth  sensor  within  the  upcoming  observation  period.  Thus,  the  anticipated  Mk 
measurements  to  be  collected  by  the  //th  sensor  on  the  i^th  target  during  the  forthcoming  observation 
period,  1  <  t  <  can  potentially  produce  the  local  gain: 


A.I  k 


=  E  K  (**  w  .) ■)]  «,)  ‘ K„  (*J  .1,- .)  i)  <07> 

in  information  relative  to  the  inverse  of  the  global  error  covariance.  Pk  [k  valid  at  time  tk-i- 

In  other  words,  equation  (66)  can  be  rewritten  as 


4-  A 

Jfc| k  -  Jk  llfc-i  +  k  ' 


(68) 


where  =  (p[.  and  ^  L  —  {^k-ik  i)  are  ^ie  Fisher  information  matrices  as¬ 
sociated  with  the  local  and  global  updated  error  covariances  valid  at  times  and  tk~ i?  respec¬ 
tively.  Thus,  expression  (67)  provides  the  expected  gain  in  the  Fisher  information  matrix  at  time 
tig.  Once  actual  measurenients  have  been  collected,  the  predicted  measurement  error  covariances. 


fils 


r;;  ,  in  (67)  are  replaced  with  actual  measurement  error  covariances,  •  in  order  to  compute 

4  Ul/  t  M17 


the  actual  Fisher  information  matrix;  that  is.  at  that  point,  AJ^.  and  J^./.  in  (68)  are  replaced 

>k  cmivi  ^ k  k%  respectively. 


with  their  actual  values,  denoted  by  AT/'  and  j£n 


5.1  CRAMER  RAO  LOWER  BOUNDS 


Since  target  dynamics  in  the  absence  of  process  noise  are  completely  characterized  by  the 
state,  xo,  valid  at  a  particular  time,  to,  we  only  need  to  focus  on  estimating  xo-  The  state  estimate 
valid  at  an  any  other  time  can  subsequently  be  obtained  from  (62).  The  uncertainty  in  the  state 
estimate  valid  at  time  to  reduces  as  we  incorporate  new  measurements.  We  can  express  the  recursion 
relation  (68)  in  terms  of  Xq: 

jf(*S)=JS;  !(*&)  + (69) 

where  and  now  denote  the  predicted  local  and  actual  global  Fisher  information  matrices 

associated  with  the  estimates  of  Xq  at  times  t k  and  tk-[.  respectively.  Specifically,  the  local  gain 
in  the  expected  Fisher  information  matrix.  AJfc  ,  is  given  by 


K 


Ajfw)  =  E  K  '  (K)  K.  (*5) <o)- 

e=i 


(70) 


Again,  once  actual  measurements  have  been  collected  and  incorporated,  J^,  AJ^,  and  R)  in  (69) 
and  (70)  are  replaced  with  Jj^.  AJj^,  and  R^ .  respectively. 

When  the  estimated  state.  Xq.  of  the  vth  target  in  (69)  and  (70)  is  replaced  with  the  true  state. 
Xq,  the  inverse  of  the  updated  Fisher  information  matrix  provides  the  Cramer-Rao  lower  bound 
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on  the  error  covariance  [14].  When  computing  Cramer  Rao  lower  bounds,  the  terms  involving 
k  ^  in  the  expressions  for  the  o-divergence  that  were  presented  in  Section  1  all  vanish. 

Subsequently,  the  expected  value  of  the  o-divergence.  aA^.  becomes  the  same  as  the  actual  o- 
divergence.  aA^.  arid  therefore  and  AJ^  become  the  same  as  Jj, l>  and  AJj^.  respectively. 
Furthermore,  the  error  covariances.  P^.  and  Pj^.  j  in  the  expressions  foi  the  o-entropv  and  o- 

divergence  can  be  replaced  with  (Jj^)  1  and  (Jj£  t)  1 .  respectively. 

Without  committing  to  a  specific  tracking  filter,  we  can  evaluate  the  overall  performance  of 
the  target  selection  logic  outlined  in  Section  3  using  the  Cramer  Rao  lower  bounds  derived  in  this 
section2  (cf.  [21]).  Once  a  particular  sensor  selects  to  collect  measurements  on  a  particular  target, 
the  local  Fisher  information  matrix  is  computed  according  to 


h  '  A*  i 


+  A.l  A 


(71) 


where  denotes  the  index  of  the  target  selected  by  the  // ill  sensor.  The  global  Fisher  information 
matrix  is.  in  turn  updated  according  to 


V'  —  Ty 


AJ 


i 

h 


(72) 


For  each  local  measurement  cycle,  the  //th  sensor  begins  the  target  selection  process  by  accessing 
the  global  track  file  that  is  broadcast  to  all  sensors  at  regular  time  intervals.  The  sensor's  local 
processor  then  selects  only  those  targets  that  are  predicted  to  be  within  its  held  of  regard  during 
the  forthcoming  observation  period,  tk-i  <  t  <  Next,  the  local  processor  updates  the  learning 
indices  of  all  visible  targets  according  t  o  (22)  or  (25)  depending  on  whether  or  not  a  particular  target 
track  was  locally  updated  during  the  last  observation  period.  Subsequently,  the  local  processor 
selects  a  target  according  to  the  rules  outlined  in  Section  3.  Once  a  target  has  been  selected,  the 
local  gain.  AJj^,  for  the  selected  target  is  reported  to  the  central  processor  1  lie  central  processor, 
in  turn,  collects  similar  information  from  all  sensors  and  updates  the  global  track  file  according 
to  (72). 


Tor  ballistic  targets,  it  lias  been  shown  that  the  batch  least-squares  filter  provides  elfi<  ieut  stale  estimates  with 
error  covariances  that  match  the  Cramer  Rao  lower  bound  (e.g.,  see  [20]). 
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6.  NUMERICAL  RESULTS 


Wo  illustrate  the  performance  of  the  bioinspired  target  selection  algorithm  outlined  in  Sec¬ 
tion  3  using  simulation.  Twenty-one  different  scenarios  are  examined,  with  t  arget  numbers  ranging 
from  1  to  100.  Our  example  scenarios  utilize  t  hree  independently  operating  narrowband  IR  cameras 
on  board  three  separate  stationary  platforms,  each  of  which  can  track  up  to  100  ballistic  targets. 
Each  camera  is  allowed  to  pan  up  to  360  degrees  in  the  horizontal  direction,  whereas  the  panning 
movement  in  the  vertical  direction  is  restricted  to  lie  within  0  and  40  degrees.  The  ballistic  tar¬ 
gets  originate  at  the  same  time.  £o  —  0,  from  multiple  sources  randomly  distributed  over  a  fixed 
rectangular  area  and  terminate  at  the  same  location.  The  targets'  times  of  flight  are  chosen  to 
be  proportional  to  the  ground  range  between  their  points  of  origin  and  the  final  destination;  thus, 
different  targets  arrive  at  different  times.  The  wide  spatial  separation  of  the  targets  together  with 
the  sensors1  narrow  fields  of  view  guarantee  that  at  most  one  target  can  be  tracked  by  a  sensor  at 
any  given  time  thus,  eliminating  the  need  for  any  local  data  association  algorithm.  However,  the 
problem  of  global  data  association  remains.  To  compute  the  entropies  and  divergences  needed  by 
the  target  selection  algorithm  outlined  in  Section  3.  we  employ  the  Cramer  Rao  lower  bounds  of 
Section  5  as  surrogates  for  the  error  covariances  associated  with  the  state  estimates  of  the  ballistic 
targets  valid  at  time  Iq  =  0. 

At  regular  time  Intervals,  a  central  processor  collects  data  reported  by  each  IR  camera  here, 
in  the  form  of  AJ^"  discussed  in  Section  5  and  updates  the  global  track  file  using  equation  (72). 
The  global  track  file,  the  contents  of  which  are  broadcast  to  all  sensors  at  regular  time  intervals,  is 
assumed  to  have  been  already  populated  with  the  initial  values  of  all  targets'  Fisher  information 
matrices  for  example,  obtained  by  means  of  a  separate  set  of  sensors  operating  at  a  previous 
processing  epoch  -prior  to  the  targets'  entries  into  the  fields  of  regard  of  any  of  the  three  IR 
cameras.  We  do  not  account  for  any  search  or  acquisition  time;  thus,  once  a  sensor  has  tasked  itself 
to  collect  data  on  a  target  different  than  the  one  currently  being  tracked,  local  target -to-target 
switching  is  assumed  to  occur  instantaneously3. 

We  vary  the  complexity  of  the  resource  allocation  problem  by  starting  with  a  single  target 

and  subsequently  increasing  the  number  of  targets,  in  steps  of  5.  up  to  100  (i.e..  1.  5.  10.  15 . 95. 

100  targets).  Each  numerical  experiment  is  repeated  10  times,  and  results  are  averaged  to  evaluate 
performance.  The  simulation  parameters  are  listed  in  Table  1.  The  global-track-file  update  occurs 
every  2  seconds;  thus,  each  sensor,  with  a  measurement  sampling  time  of  1  second,  can  potentially 
produce  up  to  two  measurements  per  target  (i.e..  Mf!  =  2).  Our  goal  is  to  ensure  that  for  as 
many  targets  as  possible  the  state  estimates  meet  a  particular  level  of  accnrac  >  as  quantified  by  t  he 
a-entropy.  Targets  whose  a-entropies  fall  below  a  certain  threshold  arc*  referred  to  as  “serviced1’; 
the  threshold  itself  is  referred  to  as  the  "quality  of  service.’1  We  have  chosen  a  =  1/2.  in  which 
case  the  a-divergence  becomes  the  Hellinger  affinity  (see  Section  2).  Wo  defer  the  examination  of 
the  impact  of  other  values  of  a  on  system  performance  to  a  future  study.  Since  no  process  noise  is 
included  in  the  problem,  the  value  of  the  global  entropy  for  a  particular  target  does  not  increase 
once  a  sufficient  number  of  measurements  have  been  in<  orporated  to  lower  it  below  a  certain  level. 


’Although  not  pursued  here,  one  can  readily  account  for  the  time  lost  in  acquiring  a  given  target  by  excluding  the 
measurements  that  would  otherwise  have  been  included  in  the  sum  (70). 


TABLE  1.  Simulation  Parameters 


Parameter 

Value 

Maximum  Number  of  Targets 

100 

Number  of  Sensors 

3 

Local  Sampling  Time 

1  s 

Global  Sampling  Time 

2  s 

Measurement  Noise  Standard  Deviation 

500  //rad 

Quality  of  Service 

25  log  (rn6/s3) 

Forgetting  Time  Constant,  r 

10  s 

Number  of  Monte  Carlo  Trials 

10 

The  average  number  of  targets  serviced  as  a  function  of  the  total  number  of  targets  in  a 
given  scenario  is  shown  in  Figure  4.  Averages  are  computed  over  10  Monte  Carlo  trials.  The 
launch  locations  of  the  ballistic:  targets  are  randomly  varied  over  a  rectangular  region  from  one 
Monte  Carlo  trial  to  the  next:  however,  in  all  trials,  all  targets  terminate  at  the  same  location. 
The  locations  of  the  three  sensor  platforms  also  remain  the  same  throughout  the  simulation.  The 
dashed  line  in  Figure  4  corresponds  to  the  ideal  case  when  all  t  arget  s  meet  their  qualities  of  service 
at  all  times.  However,  as  the  number  of  targets  increases,  degradation  in  performance  would  be 
expected.  The  green  curve  labeled  as  “bio  inspired”  in  Figure  4  is  obtained  by  employing  the  target 
selection  algorithm  outlined  in  Section  3.  VVe  see  that  the  bioinspired  algorithm  performs  very  well. 
We  only  observe  a  relatively  small  degradation  in  performance  when  the  number  of  targets  nears 
100. 


For  reference,  we  also  compare  the  performance  of  the  bioinspired  algorithm  with  two  other 
target  selection  algorithms.  I  he  red  curve  labeled  as  “uniformly  random”  in  Figure  4  is  similar  to 
the  bioinspired  algorithm,  a  decentralized  sensor-level  target  selection  algorithm.  This  algorithm 
randomly  selects  a  target  from  a  list  of  targets  predicted  to  be  visible  within  the  upcoming  ob- 
servation  period.  The  targets  are  assigned  equal  weights;  thus,  the  random  selection  is  uniform. 
As  is  evident  from  Figure  4.  for  small  number  of  targets  (up  to  around  25  in  our  example),  the 
performance  of  the  uniformly  random  algorithm  is  nearly  perfect — as  is  indeed  the  case  for  the 
other  algorithms.  This  result  suggests  that  at  least  for  a  small  number  of  targets,  an  algorithm 
as  simple  as  the  uniformly  random  target  selection  algorithm  can  be  employed  reliably  to  meet  all 
resource  allocation  requirements,  and  therefore,  there  may  be  no  need  to  resort  to  a  more  expensive 
solution. 

In  addition  to  the  decentralized,  uniformly  random  target  selection  algorithm,  we  also  compare 
the  performance  of  the  bioinspired  algorithm  to  that  of  a  greedy,  centrally  processed  sensor- to-target 
assignment  algorithm.  The  blue  curve  labeled  as  “greedy”  in  Figure  4  shows  results  obtained  using 
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Figure  f.  Average  number  of  targets  serviced  versus  actual  number  of  targets.  Averages  are  computed  over 
10  Monte  Carlo  trials.  See  text  for  an  explanation  of  the  different  curves 


t.hi.s  algorithm.  Our  centrally  processed  algorithm  assigns  sensors  to  targets  by  solving  a  linear 
assignment  problem.  The  centrally  processed  algorithm  is  “greedy"  in  that  at  any  time  interval. 
(1)  it  assigns  sequentially  one  sensor  per  target,  and  (2)  it  solves  the  assignment  problem  only  lor 
the  forthcoming  observation  period  without  any  regard  to  past  assignments  or  prediction  of  future 
performance.  The  former  restriction  prevents  the  simultaneous  coverage  of  targets  by  multiple 
sensors,  whereas  the  target  selection  algorithm  of  Section  3.  although  not  “consciously.”  does  allow 
for  simultaneous  multiple-sensor  coverage.  I  he  lack  of  simultaneous  multiple-sensor  coverage  over 
a  given  observation  period  in  the  greedy  algorithm  may  partially,  although  not  entirely,  explain  the 
poorer  performance  of  the  algorithm  for  a  larger  number  of  targets  (greater  than  around  GO  targets 
in  our  example).  We  would  expect  a  more  sophisticated  centralized  algorithm  that  fully  takes  takes 
into  account  the  stochastic  nature  of  the  scenario  to  perforin  at  least  as  good  as  or  better  than  the 
bioinspired  target  selection  algorithm.  However,  the  main  advantages  of  i  he  target  selection  logic  of 
Section  3  are:  (1)  it  is  scalable,  and  (2)  it  requires  less  processing  and  communication  bandwidth. 
Thus,  in  many  applications,  a  quick  cost-benefit  analysis  may  reveal  its  implementation  to  be  more 
advantageous  as  compared  to  an  otherwise  optimal,  centrally  processed  algorithm.  A  complete 
comparative  analysis  of  various  centralized  and  decentralized  algorithms  is  beyond  the  scope  of  this 
report  and  is  deferred  to  a  future  study. 

In  Figure  4,  the  average  number  of  targets  serviced  were  computed  v\heii  the  three  sensors' 
opportunities  to  observe  all  targets  had  been  exhausted.  For  a  subset,  of  scenarios  considered,  plots 
of  the  average  number  of  targets  serviced  as  a  function  of  time  are  shown  in  Figure  5.  In  addition 
to  the  three  curves  that  were  considered  in  Figure  4.  we  have  also  included  a  curve  labeled  as 
“serviceable”  in  Figure  5.  Since  not  all  targets  are  within  the  fields  of  regard  of  all  sensors  at  all 
times,  even  the  existence  of  an  optimal  resource  allocation  algorithm  could  not  guarantee  that  the 
quality  of  service  can  be  met  for  all  targets.  This  is  a  purely  geometrical  restriction.  Nevertheless, 
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(a)  15  Targets 


(c)  70  Targets 


Time 


(b)  45  Targets 


Time 

(d)  100  Targets 


Figure  5. 


Average  number  of  targets  serviced  versus  time.  See  text  for  an  explanation  of  the  different  curves. 


the  “serviceable”  curves  provide  a  loose  upper  bound  on  performance.  As  is  evident  in  Figure  5. 
it  is  not  after  around  400  s  that  eventually  all  targets  enter  the  fields  of  regard  of  all  sensors  in 
our  example.  From  the  temporal  behavior  shown  in  Figure  5,  it  is  clear  that  the  target  selection 
algorithm  outlined  in  Section  3  performs  better  than  the  other  two  algorithms  in  that  larger  number 
of  targets  meet  their  qualities  of  service  earlier.  The  superior  performance  of  the  bioinspired  target 
selection  algorithm  becomes  even  more  evident  as  the  number  of  targets  increases. 
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7.  SUMMARY  AND  DISCUSSION 


In  this  report,  we  presented  a  bioinspired  algorithm  for  managing  the  resources  of  multi 
pie  sensors  tracking  multiple  targets  within  the  framework  of  a  decentralized  sensor  management 
architecture.  Our  target  selection  logic  follows  the  principles  of  self-organization  and  stigmergy 
observed  in  biological  swarms.  In  our  implementation,  sensors  are  self-tasking  and  are  not  required 
to  communicate  with  each  other  directly.  Nevertheless,  sensors  can  infer  from  the  global  track 
file,  which  is  continuously  broadcast  by  a  central  processing  node,  their  relative  contributions  to 
achieving  certain  levels  of  global  state  estimation  accuracy  for  part  icular  targets. 

We  devised  local  information-theoretic  target  selection  rules  for  the  general  Bayesian  formu¬ 
lation  of  the  target  tracking  problem.  We  also  provided  specific  formulas  for  problems  dominated 
by  additive  white  Gaussian  noise.  We  rely  on  the  Renyi  o-entropy  and  o  -divergence  as  measures  of 
absolute  and  relative  information,  respectively.  We  have  not  examined  the  impact  of  the  particular 
choice  of  a.  In  our  examples,  following  [10].  we  have  chosen  or  =  1/2.  We  defer  to  a  future  study 
the  examination  of  the  impact  of  the  particular  choice  of  a  on  the  performance  of  the  bioinspired 
target  selection  algorithm. 

For  reference,  we  have  compared  the  performance  of  the  bioinspired  t  arget  selection  algorithm 
with  those  of  other  suboptirnal  algorithms  using  numerical  examples  involving  ballistic  targets  (no 
process  noise).  To  demonstrate  performance,  we  have  used  Cramer  Rao  lower  bounds  as  surrogates 
for  target  state  estimation  error  covariances.  Targets  following  deterministic  trajectories  such  as 
ballistic  targets  render  a  simple  recursive  formulat  ion  of  their  Cramer  Rao  lower  bounds.  We  have 
shown  that  the  bioinspired  algorithm  is  highly  scalable  and  performs  very  well  for  large  numbers  of 
targets.  Our  numerical  examples  were  devised  such  that  data  association  would  pose  no  challenge 
to  tracking  performance.  Poor  data  association  can  in  general  impact  sensor  management..  In  a 
future  study,  we  will  examine  the  impact  of  finite  process  noise,  poor  data  association,  and  sensor 
biases  on  the  performance  of  the  bioinspired  target  selection  algorithm. 

The  target  selection  rules  outlined  in  Section  are  by  no  means  the  only  rules  one  could  de¬ 
vise.  The  rules  could  be  extended,  and  the  bioinspired  target  selection  algorithm  may  be  optimized 
for  specific  applications.  For  example,  in  the  case  of  deterministic  dynamics,  such  as  the  example 
scenarios  considered  in  Section  G.  the  predictable  nature  of  target  motion  may  be  exploited  to 
improve  the  formulation  of  the  learning  index,  Some  knowledge  of  other  sensors'  observation 

capabilities,  if  available,  can  also  be  exploited  to  improve  the  formula!  ion  of  the  learning  index. 
Although  in  a  decentralized  sensor  management  architecture  sensors  are  not  required  to  communi¬ 
cate  with  each  other  directly,  certain  information  about  them,  such  as  their  locations  and  fields  of 
regard,  can  nevertheless  be  readily  broadcast  by  the  central  processing  nodi'  with  little  added  cost. 
We  defer  the  examination  of  such  extensions  of  the  algorithm  to  a  future  study. 
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